Copy Number Variants and Segmental Duplications Show Different Formation Signatures

نویسندگان

  • Philip M. Kim
  • Jan O. Korbel
  • Xueying Chen
  • Mark B. Gerstein
چکیده

In addition to variation in terms of single nucleotide polymorphisms (SNPs), whole regions ranging from several kilobases up to a megabase in length differ in copy number among individuals. These differences are referred to as Copy Number Variants (CNVs) and extensive mapping of these is underway. Recent studies have highlighted their great prevalence in the human genome. Segmental Duplications (SDs) are long (>1kb) stretches of duplicated DNA with high sequence identity. They are generally thought to be the result of CNVs reaching fixation in the population. To elucidate likely mechanisms of formation of these features, we examine in detail patterns of co-occurrence of different genomic features with both CNVs and SDs. First, we analyzed the co-localization of SDs and find that SDs are significantly co-localized with each other, resulting in a power-law distribution, which suggests a preferential attachment mechanism, i.e. existing SDs are likely to be involved in creating new ones nearby. This finding is further bolstered by an observation that young SDs most strongly co-localize with SDs that are closest in age and less strongly with older SDs. In line with this, we observe significant association of CNVs with SDs, but this association is weaker than may be expected. Second, we look at the relationship of CNVs/SDs with various types of repeats. We we find that the previously recognized association of SDs with Alu elements is significantly stronger for older SDs and is sharply decreasing for younger ones. While it might be expected that the patterns should be similar for SDs and CNVs, we find, surprisingly, no association of CNVs with Alu elements. This trend is consistent with the decreasing correlation between Alu elements and younger SDs – the activity of Alu elements has been decreasing and by now it they seem no longer active. Furthermore, we find a striking association of SDs with processed pseudogenes suggesting that they may also have mediated SD formation. In line with the trend for Alu elements, this association is decreasing for recent SDs and is quite weak for CNVs. Moreover, find strong association with microsatellites for both SDs and CNVs that suggests a role for satellites in the formation of both. Finally, a manual analysis of a small number of CNV breakpoints is suggestive of an alternative mechanism. In summary, we find striking differences in formation signatures of CNVs and SDs that can partly be explained by a decrease in Alu activity. …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The genomic architecture of segmental duplications and associated copy number variants in dogs.

Structural variation is an important and abundant source of genetic and phenotypic variation. Here we describe the first systematic and genome-wide analysis of segmental duplications and associated copy number variants (CNVs) in the modern domesticated dog, Canis familiaris, which exhibits considerable morphological, physiological, and behavioral variation. Through computational analyses of the...

متن کامل

Hotspots for copy number variation in chimpanzees and humans.

Copy number variation is surprisingly common among humans and can be involved in phenotypic diversity and variable susceptibility to complex diseases, but little is known of the extent of copy number variation in nonhuman primates. We have used two array-based comparative genomic hybridization platforms to identify a total of 355 copy number variants (CNVs) in the genomes of 20 wild-born chimpa...

متن کامل

Analysis of the multi-copy gene family FAM90A as a copy number variant in different ethnic backgrounds.

Copy number variants contribute extensively to inter-individual genomic differences, but little is known about their inter-population variability and diversity. In a previous study (Bosch et al., 2007; 16:2572-2582), we reported that the primate-specific gene family FAM90A, which accounts for as many as 25 members in the human reference assembly, has expanded the number of FAM90A clusters acros...

متن کامل

Replication stress induces genome-wide copy number changes in human cells that resemble polymorphic and pathogenic variants.

Copy number variants (CNVs) are an important component of genomic variation in humans and other mammals. Similar de novo deletions and duplications, or copy number changes (CNCs), are now known to be a major cause of genetic and developmental disorders and to arise somatically in many cancers. A major mechanism leading to both CNVs and disease-associated CNCs is meiotic unequal crossing over, o...

متن کامل

A novel mechanistic spectrum underlies glaucoma-associated chromosome 6p25 copy number variation.

The factors that mediate chromosomal rearrangement remain incompletely defined. Among regions prone to structural variant formation, chromosome 6p25 is one of the few in which disease-associated segmental duplications and segmental deletions have been identified, primarily through gene dosage attributable ocular phenotypes. Using array comparative genome hybridization, we studied ten 6p25 dupli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007